⚡ Flash Attention - miterion · Scour

Minimax pre-training lead explains why no linear attention

reddit.com·2d·

Discuss: r/LocalLLaMA

👁️Attention Optimization

Flag this post

Stable Video Infinity: Infinite-Length Video Generation with Error Recycling

paperium.net·12h·

Discuss: DEV

📊Gradient Accumulation

Flag this post

Specialized structure of neural population codes in parietal cortex outputs

nature.com·1d

🧩Attention Kernels

Flag this post

Microstutter in games? Your RGB software might be why

howtogeek.com·3h

📈Occupancy Optimization

Flag this post

🧠 Soft Architecture (Part B): Emotional Timers and the Code of Care (Part 5 of the SaijinOS series)

dev.to·6h·

Discuss: DEV

🤖AI Coding Tools

Flag this post

Everything About Transformers

krupadave.com·2d

🧩Attention Kernels

Flag this post

Your Transformer is Secretly an EOT Solver

elonlit.com·1d·

Discuss: Hacker News

👁️Attention Optimization

Flag this post

Show HN: Hot or Slop – Visual Turing test on how well humans detect AI images

hotorslop.com·1d·

Discuss: Hacker News

👁️Attention Optimization

Flag this post

Sparse Adaptive Attention “MoE”: How I Solved OpenAI’s $650B Problem With a £700 GPU

medium.com·4d·

Discuss: Hacker News, r/LocalLLaMA

🧩Attention Kernels

Flag this post

Cycle-accurate 6502 emulator as coroutine in Rust

github.com·4h·

Discuss: Hacker News

📊Profiling Tools

Flag this post

Dual-format attentional template during preparation in human visual cortex

elifesciences.org·3d

🧩Attention Kernels

Flag this post

Accelerating AI inferencing with external KV Cache on Managed Lustre

cloud.google.com·1d

⚡ONNX Runtime

Flag this post

TinyML is the most impressive piece of software you can run on any ESP32

xda-developers.com·1d

⚡ONNX Runtime

Flag this post

EP187: Why is DeepSeek-OCR such a BIG DEAL?

blog.bytebytego.com·3h

🤖AI Coding Tools

Flag this post

😺 🎙️ Adobe’s CTO: How AI will end creative “grunt work”

theneurondaily.com·1d

🤖AI Coding Tools

Flag this post

An underqualified reading list about the transformer architecture

fvictorio.github.io·2d·

Discuss: Hacker News

🧩Attention Kernels

Flag this post

Contribution-Guided Asymmetric Learning for Robust Multimodal Fusion under Imbalance and Noise

arxiv.org·1d

📉Model Quantization

Flag this post

RF-DETR Under the Hood: The Insights of a Real-Time Transformer Detection

towardsdatascience.com·1d

👁️Attention Optimization

Flag this post

A Minimal Route to Transformer Attention

neelsomaniblog.com·2d·

Discuss: Hacker News

🧩Attention Kernels

Flag this post

Clarity From Chaos: AI Super-Resolution Redefined

dev.to·12h·

Discuss: DEV

👁️Attention Optimization

Flag this post

Loading more...